On Computing Nash Equilibrium in Stochastic Games

نویسندگان

  • VIKAS VIKRAM SINGH
  • N. HEMACHANDRA
  • MALLIKARJUNA RAO
چکیده

Using the fact that any two player discounted stochastic game with finite state and action spaces can be recast as a non-convex constrained optimization problem, where each global minima corresponds to a stationary Nash equilibrium, we present a sequential quadratic programming based algorithm that converges to a KKT point. This KKT point is an -Nash equilibrium for some > 0 and under some suitable conditions we show that this KKT point corresponds to a stationary Nash equilibrium. The algorithm updates the Hessian matrix of the Lagrangian function in a specific way. We illustrate various difficulties that can arise while computing stationary Nash equilibrium of the stochastic game using a variant of pollution tax model. One interesting feature of this model (in an instance) is that it admits a Nash equilibrium which is independent of the discount factor close to 1, an extension of Blackwell optimality in Markov decision processes.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Nash Equilibrium in Graphical Games and Stochastic Graphical Games

In this paper, graphical games, a compact graphical representation for multi-player game theory is introduced. Various Nash equilibrium computing algorithms for graphical games are reviewed and a naive join tree base approximate Nash equilibrium computing algorithm is proposed. The proposed algorithm can be efficiently used in graphical games with arbitrary underlying structure. Graphical repre...

متن کامل

Nash Equilibrium Strategy for Bi-matrix Games with L-R Fuzzy Payoffs

In this paper, bi-matrix games are investigated based on L-R fuzzy variables. Also, based on the fuzzy max order several models in non-symmetrical L-R fuzzy environment is constructed and the existence condition of Nash equilibrium strategies of the fuzzy bi-matrix games is proposed. At last, based on the Nash equilibrium of crisp parametric bi-matrix games, we obtain the Pareto and weak Pareto...

متن کامل

Computing Equilibria in Multiplayer Stochastic Games of Imperfect Information

Computing a Nash equilibrium in multiplayer stochastic games is a notoriously difficult problem. Prior algorithms have been proven to converge in extremely limited settings and have only been tested on small problems. In contrast, we recently presented an algorithm for computing approximate jam/fold equilibrium strategies in a three-player nolimit Texas hold’em tournament—a very large realworld...

متن کامل

Fast Planning in Stochastic Games

Stochastic games generalize Markov decision processes (MDPs) to a multiagent setting by allowing the state transitions to depend jointly on all player actions, and having rewards determined by multiplayer matrix games at each state. We consider the problem of computing Nash equilibria in stochastic games, the analogue of planning in MDPs. We begin by providing a generalization of nite-horizon v...

متن کامل

Computing Nash Equilibria for Stochastic Games

We present two new algorithms for computing Nash equilibria of stochastic games. One is a global random start algorithm based on nonlinear programming, while the other combines a dynamical system with nonlinear programming to find stable equilibria. Promising numerical results are presented.

متن کامل

The Stochastic Response Dynamic: A New Approach to Learning and Computing Equilibrium in Continuous Games

This paper describes a novel approach to both learning and computing Nash equilibrium in continuous games that is especially useful for analyzing structural game theoretic models of imperfect competition and oligopoly. We depart from the methods and assumptions of the traditional “Calculus” approach to computing equilibrium. Instead, we use a new and natural interpretation of games as condition...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012